From one base form to multiple output styles - predicting stylistic dynamics of discourse prosody
نویسندگان
چکیده
We hypothesize that various prosody output styles can be predicted and simulated from one default base form by accounting for contributions from higher level information to cross-phrase prosodic relationship. Speech materials of four prosody styles were selected: (1.) Han and Tang poetry, (2.) Tang Ballads and Song poetry, (3.) Qin, Tang and Song classic prose and (4.) contemporary TV weather forecast. F0 contours were analyzed using the Fujisaki model, while quantitative analyses of predictions from layered-andcumulative contribution specified by the HPG (Hierarchical Prosodic phrase Grouping) framework [Tseng et al, 2004; 2005; 2006] were performed across styles and speakers. Results confirmed that higher level contribution is significant across style; contribution distribution patterns and style specific; more regular prosodic formats require more contribution from higher level; stylistic dynamics are predictable; and the HPG base form is indeed default.
منابع مشابه
A Computational Memory and Processing Model for Processing
This paper links prosody to the information in the text and how it is processed by the speaker. It describes the operation and output of Loq, a text-to-speech implementation that includes a model of limited attention and working memory. Attentional limitations are key. Varying the attentional parameter in the simulations varies in turn what counts as given and new in a text, and therefore, the ...
متن کاملPredicting Prosody from Text
In order to improve unlimited TTS, a framework to organize the multiple perceived units into discourse is proposed in [1]. To make an unlimited TTS system, we must transform the original text to the text with corresponding boundary breaks. So we describe how we predicate prosody from Text in this paper. We use the corpora with boundary breaks which follow the prosody framework. Then we use the ...
متن کاملGenerating pitch accent distributions that show individual and stylistic differences
I describe a limited-resource approach to generating prosody that mediates text-based information through a model of attention and working memory, whose simulation parameters are quantitative. The main parameter quanties recall. Varying it varies what counts as given and new in a text, and therefore, the pitch accents with which the text is uttered. Currently, the system produces prosody in thr...
متن کاملA Multi-perspective Approach to the Dynamics of Real-time Prosody
Much of what is known about prosody derives from clinical studies of adults with hemispheric lesions. Moreover, prosodic abnormalities tend to be interpreted with little attention to speech planning difficulties. This investigation describes a model of discourse and speech planning that utilizes an integrated methodology, incorporating neuroanatomical, discourse, and acoustic-physiological doma...
متن کامل